15 research outputs found

    Investigation into the annotation of protocol sequencing steps in the sequence read archive

    Get PDF
    BACKGROUND: The workflow for the production of high-throughput sequencing data from nucleic acid samples is complex. There are a series of protocol steps to be followed in the preparation of samples for next-generation sequencing. The quantification of bias in a number of protocol steps, namely DNA fractionation, blunting, phosphorylation, adapter ligation and library enrichment, remains to be determined. RESULTS: We examined the experimental metadata of the public repository Sequence Read Archive (SRA) in order to ascertain the level of annotation of important sequencing steps in submissions to the database. Using SQL relational database queries (using the SRAdb SQLite database generated by the Bioconductor consortium) to search for keywords commonly occurring in key preparatory protocol steps partitioned over studies, we found that 7.10%, 5.84% and 7.57% of all records (fragmentation, ligation and enrichment, respectively), had at least one keyword corresponding to one of the three protocol steps. Only 4.06% of all records, partitioned over studies, had keywords for all three steps in the protocol (5.58% of all SRA records). CONCLUSIONS: The current level of annotation in the SRA inhibits systematic studies of bias due to these protocol steps. Downstream from this, meta-analyses and comparative studies based on these data will have a source of bias that cannot be quantified at present

    Genomic landscape and clonal architecture of mouse oral squamous cell carcinomas dictate tumour ecology.

    Get PDF
    To establish whether 4-nitroquinoline N-oxide-induced carcinogenesis mirrors the heterogeneity of human oral squamous cell carcinoma (OSCC), we have performed genomic analysis of mouse tongue lesions. The mutational signatures of human and mouse OSCC overlap extensively. Mutational burden is higher in moderate dysplasias and invasive SCCs than in hyperplasias and mild dysplasias, although mutations in p53, Notch1 and Fat1 occur in early lesions. Laminin-α3 mutations are associated with tumour invasiveness and Notch1 mutant tumours have an increased immune infiltrate. Computational modelling of clonal dynamics indicates that high genetic heterogeneity may be a feature of those mild dysplasias that are likely to progress to more aggressive tumours. These studies provide a foundation for exploring OSCC evolution, heterogeneity and progression
    corecore